fix(langchain): anthropic cache token count #2414

RogerHYang · 2025-11-06T16:57:05Z

resolves #2381

Note

Improve token usage extraction to handle Anthropic cache read/write and LangChain UsageMetadata (with Bedrock heuristics), update deps, and add targeted tests.

Instrumentation (_tracer.py):
- Token count parsing:
  - Add _is_lc_usage_metadata/_token_counts_from_lc_usage_metadata to map langchain_core.messages.ai.UsageMetadata, including audio/reasoning and cache details with Bedrock-specific heuristics.
  - Add _is_raw_anthropic_usage_with_cache_read_or_write/_token_counts_from_raw_anthropic_usage_with_cache_read_or_write to handle Anthropic cache_read_input_tokens/cache_creation_input_tokens and emit detailed cache attributes.
  - Integrate both into _token_counts; remove previous key-based Anthropic handling.
- Minor: type additions (TypedDict, TypeGuard, UsageMetadata).
Tests:
- New tests/test_token_counts.py covering LC usage metadata and Anthropic cache scenarios.
- Update tests/test_instrumentor.py Anthropic expectations (LLM_TOKEN_COUNT_PROMPT from 22 to 33).
Dependencies:
- Bump langchain_core to >= 0.3.9 (instruments) and == 0.3.9 (type-check).
CI/Tooling:
- tox.ini: add uv pip list -v in commands_pre.

^{Written by Cursor Bugbot for commit 30a6e95. This will update automatically on new commits. Configure here.}

...eninference-instrumentation-langchain/src/openinference/instrumentation/langchain/_tracer.py

caroger · 2025-11-10T20:37:34Z

...eninference-instrumentation-langchain/src/openinference/instrumentation/langchain/_tracer.py

+            "cache_read_input_tokens" in obj
+            and isinstance(obj["cache_read_input_tokens"], int)
+            or "cache_creation_input_tokens" in obj
+            and isinstance(obj["cache_creation_input_tokens"], int)


nit: Add explicit parentheses in the type guard for clarity

return ( "input_tokens" in obj and "output_tokens" in obj and isinstance(obj["input_tokens"], int) and isinstance(obj["output_tokens"], int) and ( ("cache_read_input_tokens" in obj and isinstance(obj["cache_read_input_tokens"], int)) or ("cache_creation_input_tokens" in obj and isinstance(obj["cache_creation_input_tokens"], int)) ) )

cursor · 2025-11-10T21:11:32Z

...eninference-instrumentation-langchain/src/openinference/instrumentation/langchain/_tracer.py

        )
    ):
        return
+    keys: Sequence[str]


Bug: Anthropic Usage Metrics Disappear

Removing input_tokens and output_tokens from the generic token count extraction breaks basic Anthropic responses that don't include cache tokens or total_tokens. Previously, responses with just input_tokens and output_tokens (standard Anthropic format without caching) would be captured by the first loop. Now they're only handled by _is_raw_anthropic_usage_with_cache_read_or_write, which requires cache tokens to be present, causing basic Anthropic token counts to be lost.

fix: anthropic cache token count

2d830f0

RogerHYang requested a review from a team as a code owner November 6, 2025 16:57

github-project-automation bot added this to Instrumentation Nov 6, 2025

dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Nov 6, 2025

cursor bot reviewed Nov 6, 2025

View reviewed changes

...eninference-instrumentation-langchain/src/openinference/instrumentation/langchain/_tracer.py Show resolved Hide resolved

RogerHYang added 3 commits November 6, 2025 09:52

fix test

b7f65e3

Merge branch 'main' into fix-anthropic-cache-token-count

b182ced

clean up

30538fa

RogerHYang marked this pull request as draft November 6, 2025 21:38

RogerHYang added 2 commits November 7, 2025 09:14

clean up

4c0a08c

clean up

8f3b955

RogerHYang marked this pull request as ready for review November 7, 2025 17:18

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. and removed size:XL This PR changes 500-999 lines, ignoring generated files. labels Nov 7, 2025

caroger reviewed Nov 10, 2025

View reviewed changes

caroger approved these changes Nov 10, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Nov 10, 2025

add parentheses

30a6e95

cursor bot reviewed Nov 10, 2025

View reviewed changes

RogerHYang merged commit 9f21f0f into main Nov 10, 2025
13 checks passed

RogerHYang deleted the fix-anthropic-cache-token-count branch November 10, 2025 21:18

github-project-automation bot moved this to Done in Instrumentation Nov 10, 2025

github-actions bot mentioned this pull request Nov 10, 2025

chore: release main #2427

Merged

RogerHYang mentioned this pull request Nov 10, 2025

fix: anthropic token count #2428

Merged

dosubot bot mentioned this pull request Nov 20, 2025

[BUG]: no token count on langgraph instrumentation #2463

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(langchain): anthropic cache token count #2414

fix(langchain): anthropic cache token count #2414

Uh oh!

RogerHYang commented Nov 6, 2025 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

caroger Nov 10, 2025 •

edited

Loading

Uh oh!

cursor bot Nov 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(langchain): anthropic cache token count #2414

fix(langchain): anthropic cache token count #2414

Uh oh!

Conversation

RogerHYang commented Nov 6, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

caroger Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cursor bot Nov 10, 2025

Choose a reason for hiding this comment

Bug: Anthropic Usage Metrics Disappear

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RogerHYang commented Nov 6, 2025 •

edited by cursor bot

Loading

caroger Nov 10, 2025 •

edited

Loading